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Abstract 

We introduce one-way games, a framework motivated by applications in large-scale power 
restoration, humanitarian logistics, and integrated supply-chains. The distinguishable feature 
of the games is that the payoff of some player is determined only by her own strategy and 
does not depend on actions taken by other players. We show that the equilibrium outcome in 
one-way games without payments and the social cost of any ex-post efficient mechanism, can 
be far from the optimum. We also show that it is impossible to design a Bayes-Nash incentive- 
compatible mechanism for one-way games that is budget-balanced, individually rational, and 
efficient. To address this negative result, we propose a privacy-preserving mechanism that is 
incentive-compatible and budget-balanced, satisfies ex-post individual rationality conditions, 
and produces an outcome which is more efficient than the equilibrium without payments. The 
mechanism is based on a single-offer bargaining and we show that a randomized multi-offer 
extension brings no additional benefit. 


1 Introduction 


When modeling economic interactions between agents, it is standard to adopt a general framework 
where payoffs of individuals are dependent on the actions of all other decision-makers. However, 
some agents may have payoffs that depend only on their own actions, not on actions taken by 
other agents. In this paper, we explore the consequences of such asymmetries among agents. Since 
these features lead to a restricted version of the general model, the hope is that we can identify 
mechanisms that produce efficient outcomes by exploiting the properties of this specific setting. 

A classic application of this setting is Coase’s ex ample of a polluter and a single victim, e.g., a 
steel mill that affects a laundry. The Coase theorem ( IQBfll ) is often interpreted as a demonstration 
of why private negotiations between polluters and victims can yield efficient levels of pol lution 
without government interference. However, in an influential article, Hahnel and Sheeran ( 20091 ) 
criticize the Coase theorem by showing that, under more realistic conditions, it is unlikely that 
an efficient outcome will be reached. They emphasize that the solution is a negotiation, and 
not a market-based transaction as described by Coase. As such, incomplete information plays an 
important role and game theory and bargaining games can explain inefficient outcomes. 

Other real-life application s are found in larg e -scale restoration of i nterd ependent infrastructures 


after significant disruptions Cavdaroglu et al . 201,4 Coffrin et al . 2012 ]. humanitarian logistics 


An earlier, shorter version of this paper appeared in Proceedings of the Twenty-Fourth International joint con¬ 
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over m ultipl e states or regions [Van Hentenrvck et all l201[ll |. supply chain coordination (see, e.g., 
Voigt (l2nil )). integrated logistics, and the joint planning and the control of gas and electricity 
networks. Consider for example the restoratio n of the power system a nd the telecommnnication 
network after a major disaster. As explained in lCavdarogln et al\ [201.'ll ], there are one-way depen¬ 
dencies between the power system and the telecommunication network. This means, for instance, 
that some power lines must be restored before some parts of the telecommunication network become 
available. It is possible to use centralized mechanisms for restoring the system as a whole. However, 
it is often the case that these restorations are performed by different agencies with independent 
objectives and selfish behavior may have a strong impact on the social welfare. It is thus impor¬ 
tant to stndy whether it is possible to find high-quality outcomes in decentralized settings when 
stakeholders proceed independently and do not share complete information about their utilities. 

This paper aims at taking a first step in this direction by proposing a class of two players one¬ 
way dependent decision settings which abstracts some of the salient features of these applications 
and formalizes many of Hahnel and Sheeran’s critiques. We present a number of negative and 
positive results on one-way games. We first show that Nash equilibria in one-way games, under 
no side payments, can be arbitrarily far from the optimal social welfare. Moreover, in contrast to 
Coase theorem, we show that when side payments are allowed in a Bayes-Nash incentive-compatible 
setting, there is no ex-post efficient individually rational, and budget-balanced mechanism for one¬ 
way games. To address this negative result, we focus on mechanisms that are budget-balanced, 
individually rational, incentive-compatible that are relatively efficient. Our main positive result is 
a single-offer bargaining mechanism which under reasonable assumptions on the players, increases 
the social welfare compared to the setting where no side payments are allowed. We also show that 
this single-offer mechanism cannot be improved by a (randomized) multi-offer mechanism. 

The rest of this paper is organized as follows. In Section [2] we define one-way games and study 
the properties of Nash equilibria. In section ST] we prove an impossibility result. The single offer 
mechanism is presented and analyzed in sections [5] and 15.11 respectively. Finally, in section [6] we 
present a multi-offer mechanism and we show that it doesn’t improve the efficiency with respect to 
the single-offer one. 


2 One-Way Games 

One-way games feature two players A and B. Each player i ^ A, B has a public action set Si 
and we write S = Sa x Sb to denote the set of joint action profiles. As most commonly done in 
mechanism design, we model private information by associating each agent i with a payoff function 
Ui : S X Qi ^ M+, where Ui{s, 0i) is the agent utility for strategy profile s when the agent has type 
6i. We assume that the player types are stochastically independent and drawn from a distribution / 
that is common knowledge. We denote by 0j the possible types of player i and write 0 = 0^ x 0^. 
If 0 € 0, we use 0i to denote the type of player i in 0. Similar conventions are used for strategies, 
utilities, and type distributions. 

A key feature of one-way games is that the payoff ua{{sa, sb),0a) of player A is determined 
only by her own strategy and does not depend on H’s actions, i.e., 

ysA, SB, s'b, 0a ■ uaUsa, Sb), 0a) = ua{{sa, s'b),Ga)- 

As a result, for ease of notation, we use ua(sa,0a) to denote T’s payoff. Obviously, player B must 
act according to what player A chooses to do and we use sb(sa, 0b) to denote the best response of 
player B given that A plays action sa and player B has type 0b, i.e., 

sb{sa,0b) = arg-max UB{{sA, sb), 0b) 

sb&Sb 
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where ties are broken arbitrarily. In this paper, we always assume that ties are broken arbitrarily 
in arg-max expressions. 

One-way games assumes that players are risk-neutral agents and that after having observed the 
realization of their own types, players simultaneously choose their actions. As a consequence, if 
side payments are not allowed, player A will play an action that yields her a maximum payoff, 
i.e., 

Sa(^a) = arg-max ua(sa, 0a), 

Player B will pick s^{6b) such that her expected payoff is maximized, i.e., 

Sb{0b) = arg-maxEe^ [uB{sB{sA{0A),dB),dB)] ■ 
sb&Sb 

The set of Nash equilibria (NE) is thus characterized by (6) = {s^{9a), Sb{0b)) Q S. The 
best response s^{9b) of player B may be a bad outcome for her even when B has a much greater 
potential payoff. Player A achieves its optimal payoff, but our motivating applications aim at 
optimizing a global welfare function 


SW {{sA, sb), 9) = ua{sa, 9 a) + ub{{sa, sb), 9 b)- 

Thus, the global welfare achieved by the Nash equilibria can be expressed as SW{s^ (9), 9). We 
quantify the quality of the Nash equilibrium outcome with the price of anarchy (PoA). 

Definition 1. The price of anarehy of s^{9) C S given type 9 £ Q is defined as 


PoA{9) = 


maxse 5 SW (s, 9) 


min^g^iv( 0 ) SW{s,9)' 

A natural extension for the price of anarchy is to quantify the expected worst-case equilibrium. 
Definition 2. The Bayes-Nash price of anarchy is defined as 


PoA = Eg [PoA{9)]. 

Note that the price of anarchy given type 0 G 0 can be used to obtain a lower and an upper bound 
on the Bayes-Nash price of anarchy in the following way, 

mm.PoA{9) < PoA < max PoA{9). 

6 »ee 0 G 0 

Throughout this paper, we use the following two running examples to illustrate key concepts. 

Example 1. Consider the instance where player A has two possible actions s\,s\ G 5a- Action 
s\ has a payoff UAis\) distributed according to a uniform distribution between 0 and 100, while 
action s\ has a constant payoff ua{s\) = 100. The set of dominant actions for player B corresponds 
to the set of best responses sb(s\) and Let player B have only one type and we set payoffs 

to be ub{sb{sa)) = x and ub(sb('Sa)) ~ where x is a positive constant. When no transfers are 
allowed, player A will always play action s^, yielding a social welfare of 100 -|- 0. If player A plays 
her expected payoff is 50 and the expected social welfare is thus 50 -|- x. The price of anarchy 
is if X > 50 and 1 otherwise. Notice that the PoA is an increasing function of x. 
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Example 2. Consider the instance where player A has n possible actions E 

Each action has a payoff ua{s\) which is independent and identically distributed according to 
a Uniform distribution between 0 and 1. For player B, consider the set of best responses 
SBisA,0B),SBisA,(^B), - ■ ■ ,sb{sa,0b)- Let the expected payoffs be [ub(ss(s^,6»b),0b)] = m 
for z = 1, 2,... , n, where fii > fi 2 > • ■ ■ ^ fJ-n- All other payoffs are set to 0, i.e., for any 6b & Qb, 
E Sa and sb / sb(sa, Ob)- ub {{sb, sa), 6b) = 0. 

When no transfers are allowed, player A chooses her (realized) maximizing payoff action. Such 
action is distributed according to the largest order statistic, i.e., the maximum between all payoffs. 
The largest order statistic between n standard Uniforms follows a Beta{n, 1) distribution, with 
mean Hence, the expected payoff of player A is 

By symmetry of player H’s payoff functions, all her actions will be played with probability L, 
Thus, player H’s expected payoff is maximized when choosing sb('sJ^, 6b), yielding her an expected 
payoff of As a result, the expected social welfare in equilibrium is 

To compute the optimal social welfare note that if player A select with probability 1, her 
expected payoff is ^ and the expected payoff of player B is m and so the social welfare is ^ + /ri, 

which is greater than ^ if /^i > The price of anarchy is + /zi) / and in 

the limit as n ^ oo, the PoA becomes ^ + /zi. Notice that the PoA is an increasing function of /zi. 

Examples [T] and [2] illustrates that, when no transfers are allowed, the price of anarchy can be 
arbitrarily large as player B ’s payoff is large compare with the payoff of player A. We now 
generalize this idea to quantify the price of anarchy in one-way games. 


Proposition 1. In one-way games, the price of anarchy when no payments are allowed satisfies, 
for any type 6, 

_ maxsgB ub{s,6b) _^ PoA{6) <1-1 ub{s,6b) 

maXs£SUAis,6A)+UB{s’^{d),6B) ~ ~ max^gs0yi) ’ 


Proof. Let Ui{6i) = max^gs zzi(s, 0j), i E {A,B}. Independence of player A implies that, for all 
0 E 0, her payoff is ua{s^{6),6a) = ua{Ga)- It follows that 


max{u^(0A),zzB(0B)} 
ua{0a) + ub{s^{0),6b) 
^ ua{0a) + ub{0b) 

~ ua{6a) + ub{s^{6),6b) 


< PoA{e) 

^ ua{0a) + ub{Gb) 
~ ua{6a) 


^ ub{0b) 
ua{0a)' 


□ 


The price of anarchy can thus be arbitrarily large. When it is large enough. Proposition [T] indicates 
that maXsgB ub(s, 0 b) > niax^gB zza(S) 0 a) > ^^s(s^(0), 0 b)- In this case, player B has bargaining 
power to incentivize player A monetarily so that she moves from her equilibrium and cooperates to 
overcome a bad social welfare. This paper explores this possibility by analyzing the social welfare 
when side payments are allowed. 


3 Related Work 

Before moving to the main results, it is useful to discuss related games. One-way games may seem to 
resemble Stackelberg games with their notions of leader and follower. The key difference however is 
that, in one-way games, the leader does not depend on the action taken by the follower. In addition. 


4 








in one-way games , players do not have complete information and moves are simultaneous. Jackson 
and Wilkie (120051 ) studied one-way instances derived from their more general framework of endoge¬ 
nous games. However, they tackled the problem from a different perspective and assumed complete 
information (i.e., the player utilities are not private). Jackson and Wilkie gave a characterization 
of the outcome when players make binding offers of side payments, deriving the conditions under 
which a new outcome becomes a Nash equilibrium or remains one. They analyzed a subclass, called 
’one sided externality’, which is essentially a one-way game but with complete information. They 
showed that the efficient outcome is an equilibrium in this setting, supporting Coase’s claim that 
a polluter and his victim can reach an efficient outcome. Under perfect information, the victim 


can determine the minimal transfer necessar y to support the efficient pla y, 
does not hold under incomplete information Mverson and Satterthwaite 


Naturally, this result 
1983]. In what follows. 


we design a bargaining mechanism that is able to cope with the incomplete information setting. 


4 Bayesian-Nash Mechanisms 

In this section, we consider a Bayesian-Nash setting with quasi-linear preferences. Both players A 
and B have private utilities and beliefs about the utilities of the other players. By the revelation 
principle, we can restrict our attention to direct mechanisms which implement a social choice 
function. A social choice function in quasi-linear environments takes the form of f{9) = {k{6),ti9)) 
where, for every 0 G 0, k{9) G 5 is the allocation function and ti{9) G M represents a monetary 
transfer to agent i. The main objective of mechanism design is to implement a social choice function 
that achieves near efficient allocations while respecting some desirable properties. For completeness, 
we specify these key properties. 

Definition 3. A social choice function is ex-post effieient if, for all 9 G Q, we have k{9) G 
arg-maxgg5 Ei Ui{s,9). 

Definition 4. A social choice function is budget-balanced (BB) if, for all 9 G Q, we have ~ 

0 . 


In other words, there are no net transfers out of the system or into the system. Taken together, 
ex-post efficiency and budget-balance imply Pareto optimality. An essential condition of any mech¬ 
anism is to guarantee that agents report their true types. The following property captures this 
notion when agents have prior beliefs on the types of other agents. 


Definition 5. A social choice function is Bayes-Nash incentive compatible (IC) if for every player 
i: 


d-i),9i) -I- ti{9i, 9_i)] > Ui{k{9i, 9i), 9i) + ti{9i, 9_i) 


where 9i G 0 * is the type of player i, 9i is the type player i reports, and denotes player i’s 

expectation over prior beliefs 9-i of the types of other agents given her own type 9i. 


The most natural definition of individual-rationality (IR) is interim IR, which states that every 
agent type has non-negative expected gains from participation. 


Definition 6. A social choice function is interim individual-rational if, for all types 9 G &, it 
satisfies 

^9-i\9i [ui{k{9),9i) -G ti{9)] > Ui{9i), 
where Ui{9i) is the expected utility for non-participation. 
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In the context of one-way games, both players have positive outside options that depend only 
in their types. In particular, the outside options are given by the Nash equilibrium outcome 
under no side payments. For players A and B, the expected utilities for non-participation are 
ua{0a) = ua{sa{Sa),0a) and ub{0b) = ub{s^{0),9b) respectively. 


4.1 Impossibility Result 


This section shows that there exists no mechanism for one-way games that is efficient and s atisfie s 
the traditional desirable properties. The result is derived from the Myerson-Satterthwaite ( 19831 ) 
theorem, a seminal impossibility result in mechanism design. The Myerson-Satterthwaite theorem 
considers a bargaining game with two-sided private information and it states that, for a bilateral 
trade setting, there exists no Bayes-Nash incentive-compatible mechanism that is budget balanced, 
ex-post efficient, and gives every agent type non-negative expected gains from participation (i.e., 
ex interim individual rationality). 

Our contribution is twofold: we present an impossibility result for one-way games and we relate 
them with bargaining games, an idea that we will further explore on the following sections. We 
now formalize the impossibility result for one-way games. 

Consider the Myerson-Satterthwaite bilateral bargaining setting. 


Definition 7. Myerson-Satterthwaite bargaining game: 

1. A seller (player 1) owns an object for which her valuation is vi G Vj, and a buyer (player 2) 
wants to buy the object at a valuation U 2 G V 2 . 

2. Each player i knows her valuation Vi at the time of the bargaining and player 1 (resp. 2) has 
a probability density distribution f 2 {v 2 ) (resp. fi{vi)) for the other player’s valuation. 

3. Both distributions are assumed to be continuous and positive on their domain, and the inter¬ 
section of the domains is not empty. 


By the revelation principle, we can restrict our attention to incentive-compatible direct mecha¬ 
nisms. A direct mechanism for bargaining games is characterized by two functions: (1) a probability 
distribution cj : Ti x V 2 —)• [0,1] that specifies the probability that the object is transferred from the 
seller to the buyer and (2) a monetary transfer scheme p : Vi x V 2 ^ ^. In this setting, ex-post 
efficiency is achieved if cr(vi,V 2 ) = 1 when vi < V 2 , and 0 otherwise. 

Our result consists in showing that a mechanism Ai' for the Myerson-Satterthwaite setting can 
be constructed using a mechanism Ai for a one-way game in such a way that, if Ai is efficient, 
individual-rational (IR), incentive compatible (IC), and budget-balanced (BB), then A4' is efficient, 
IR, IC, and BB. The Myerson-Satterthwaite impossibility theorem states that such a mechanism 
Ai' cannot exist, which implies the following impossibility result for one-way games. 


Theorem 1. There is no ex-post efficient, individually rational, incentive-compatible, and budget- 
balanced mechanism for one-way games. 


Proof. For any bargaining setting, consider the following transformation into a one-way game in¬ 
stance: 


Sa = {5)4, si}, 5 b = {sb}, 

Vui G Vi : ua{sa,vi) = vi, UAisAAi) = 0, 

VU2 G V 2 : Ub{{sa,Sb),V2) = 0, Ub{{s\,Sb),V2) = V2, 
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where player types (^ 1 ,^ 2 ) G Vl x V 2 are drawn from distribution /i x f 2 . Two possible outcomes 
may occur, or {s\,sb), with social welfare vi and V2 respectively. 

Let us assume Ai = {k,t) is a direct mechanism for one-way games and that Ai is ex-post 
efficient, IR, IC, and BB. We now construct a mechanism A4' = (cr,p), where a{vi,V 2 ) is the 
probability that the object is transferred from the seller to the buyer and p{vi,V 2 ) is the payment 
of each player. We define Ai' such that 


a(vi,V2) 


0 k{vi,V2) = {s\,sb), 
1 if k{vi,V2) = {s%sb), 


and 

P{VI,V2) = t{vi,V2). 


It remains to show that Ai' satisfies all the desired properties. An ex-post efficient mechanism 
Ai in the one-way instance satisfies 


k{vi,V2) 


(s\,sb) ifvi>V2, 
{s\,sb) iivi<V2. 


Therefore, a{vi,V 2 ) will assign the object to the buyer iff vi < V 2 - That is, the player with the 
highest valuation will always get the object, meeting the restriction of ex-post efficiency. The 
budget-balanced constraint in Ai implies that pi{vi,V 2 ) + P 2 {vi,V 2 ) = 0 for all possible valuations, 
so Ai' is budget-balanced. 

The individual rationality property for Ai' comes from noticing that the default strategy of 
player A when no payments are allowed is and the corresponding payoff is vi. Therefore, the 
seller utility is guaranteed to be at least her valuation vi. Analogously, the buyer will not have a 
negative utility given that ub{{s\, sb),V 2 ) = 0. 

Incentive-compatibility is straightforward from definition. Assume that Ai' is not incentive- 
compatible, then in mechanism Ai, at least one player could benefit from reporting a false type. 

Such a mechanism Ai' cannot exist since it contradicts Myerson-Satterthwaite impossibility 
result, which concludes our proof. □ 


An immediate consequence of this result is that Bayesian-Nash mechanisms can only achieve 
at most two of the three properti es: ex-post efficiency, individual-rationality, and bud get balance. 
For instance, VCG and dAGVA d’Aspremont and Gerard-Varet . I 1979 I : lArrowl. Il979l | are part of 
the Groves family of mechanisms that truthfully implement social choice functions that are ex-post 
efficient. VGG has no guarantee of budget balance, while dAGVA is not guaranteed to meet the 
i ndivi dual-rationality constraints. We refer the reader to Williams ( 1999l i and Krishna and Perry 
( 19981 ) for alternative derivations of the impossibility result for bilateral trading under the Groves 
family of mechanisms. 


5 Single-Offer Mechanism 


In this section, we propose a simple bargaining mechanism for player B to increase her payoff. The 
literature ab out b argaining games is extensive and we refer readers to a broad review by Kennan 
and Wilson ( 1993 1. 

Given the nature of our applications, individual rationality imposes a necessary constraint. Oth¬ 
erwise, player A can always defect from participating in the mechanism and achieve her maximal 
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payoff independently of the type of player B. Additionally, we search for Bayesian-Nash mecha¬ 
nisms without subsidies, i.e., budget-balanced mechanisms. The lack of a subsidiary in this case 
gives rise to a decentralized mechanism that does not require a third agent to perform the compu¬ 
tations needed by the mechanism. However, a third party is needed to ensure compliance with the 
agreement reached by both players. 

An interesting starting point for one-way games is the recognition that, whenever player B has 
a better payoff than A, player A may let player B play her optimal strategy in exchange for money. 
The resulting outcome can be viewed as swapping the roles of both players, i.e., player B chooses 
her optimal strategy and A plays her best response to B’s strategy. In this case, as in Proposition 
m the worst outcome would be 

^ maXsGS ua{s,9a) 
max^g^ UBis,9B)' 

This observation leads to the following lemma. 

Lemma 2. Consider the social choice function that selects the best strategy that maximizes the 
payoff of either player A or player B, i.e., the strategy 


s' (6) = arg-max (max {uAis, 9a),ub{s, 9b))) ■ 

sgiS 

In the one-way game, strategy s'{9) has a price of anarchy of 2 (i.e., \/9 PoA{9) = 2). 

Unfortunately, this social choice function cannot be implemented in dominant strategies without 
violating individual rationality. Player A may have a smaller payoff by following strategy s' instead 
of the Nash equilibrium strategy s^. Indeed, when SW{s', 9) < SW{s^, 9), it must be that at least 
one of the players will be worse than playing the Nash equilibrium strategy s^. Lemma [2] however 
gives us hope for designing a budget-balanced mechanism that has a constant price of anarchy. 
Indeed, a simple and distributed implementation would ask player B to propose an action to be 
implemented and player A would receive a monetary compensation for deviating from her maximal 
strategy. 

We now present such a distributed implementation based on a bargaining mechanism. The 
mechanism is inspired by the model of two -person bargaining under incomplete information pre¬ 
sented by Chatterjee and Samuelson ( 198,11 ). In their model, both the seller and the buyer submit 
sealed offers and a trade occurs if there is a gap in the bids. The price is then set to be a convex 
combination of the bids. Our single-offer mechanism adapts this idea to one-way games. In par¬ 
ticular, to counteract player A’s advantage, player B makes the hrst and hnal offer. Moreover, the 
structure of our mechanism makes it possible to quantify the price of anarchy and provide quality 
guarantee on the mechanism outcome. Our single-offer mechanism is defined as follows: 


1. Player B selects an action G Sa to propose to player A. 

2. Player B also computes her outside option s'^{sa, 9b) in case player A rejects action sa, and 
we denote by u^{sa, 9b) the expected payoff from her outside option. 

3. Player B proposes a monetary value of 7 -Ab(sa, 9b) with As(sa, 9b) = ub{sb{sa, 9b), 9b) — 
u^{sA, 9b) and 7 G Kjoq] to player A in the hope that she accepts to play strategy sa instead 
of strategy s^. 

4. Player A decides whether to accept the offer. 

5. If player A accepts the offer, the outcome of the game is {sa, sb{sa,9b))- Otherwise the 
outcome of the game is the outside option (s^( 6 *a), SBisA,9B))- 






It is worth observing that a broker is required in this mechanism to ensure that the outcome 
s^{sa,0b)) is implemented if player A rejects the unique offer, and no counteroffers are 
made. A key feature of the single-offer mechanism is that it requires a minimum amount of infor¬ 
mation from player A (i.e., whether she accepts or rejects the offer). 

To derive the equilibrium strategy for the single-offer mechanism, we assume players are ex¬ 
pected utility maximizers. The parameter 7 € I^[o,i] has been chosen so that player B, satisfying 
individual rationality, never offers more than Ab{sa,(^b) and her payoff is never worse than her 
expected outside option u^{sa,0b)- Whereas the mechanism can only guarantee interim individ¬ 
ual rationality for player B, it provides ex-post individual rationality for player A, as shown in the 
following proposition. 

Proposition 2. If players A and B play the single-offer mechanism, for any (0 a, &b) £ player 
A accepts the offer ( 7 , sa) whenever 

ua(sa,0a) + 7 - ^b{sa,0b) > ua{sa(0a),0a)- 

In case player A rejects the offer ( 7 , 5 ^), she will choose her utility maximizing action s^(9a) 
as her outside option. Note that by Proposition [2l if 5^4 = s^(6a), player A would never reject 
the proposed action 5 ^ 4 . Accordingly, if proposed action 5^4 is rejected then 5^4 7 ^ s^(6a)- This 
observation leads to the following proposition. 

Proposition 3. For every task s G 5^4, let 0^ = Qa \ {0A £ : s = arg-max 3 ,g 5 ^ ua(x,9a)}- 

In the single-offer mechanism, player B will pick outside option s^{sa,9b) such that her expected 
payoff is maximized, i. e., 

s%{sa,9b) = arg-maxEg=A [ub ((s^(6'a), sb), 6*5)] • 

Example 1. (continued) The payoff of Player B is higher if action is played by player A. Hence, 
player B has incentives to submit an offer c that triggers action Player A accepts the offer 
if c-|- ua(sa) > ua(sa) = 100. Given that ua(sa) follows a uniform distribution, the probability 
that player A accepts the offer is ^ if c < 100 and 1 otherwise. For player B, this offer has an 
expected payoff of ^ • (a: — c) if c < 100 and x — c otherwise. The optimal value for the offer is 
given by c* = I if X < 200 and c* = 100 if x > 200. This leads to an expected social welfare for 
the single-offer mechanism of 

[50+x itx>200 

Recall that the optimal social welfare is 50 -|- x if x > 50 and 100 otherwise. Therefore, the 
mechanism has a price of anarchy, 


PoA 


' 100 

100-1-a:(^ — i) 
_ 50-l-x 

100 - 1 -— 
50+a; 

' 50+a: 


if X < 50, 
if 50 < X < 200, 
if 200 < X . 


The PoA is bounded by a constant and in fact, PoA < 1.21 for any x. This contrasts with the 
unbounded PoA obtained when no side payments are allowed. 
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Example 2. (continued) The payoff of player B is higher if action is played by player A. Hence, 
player B has an incentive to submit a monetary offer c < 1 that triggers action Player A accepts 
the offer if c + ua{s\) > max^^g^^ UA(sy 4 )- It can be shown that the probability that player A 
accepts the offer is In case of acceptance, the expected payoff is fii — c for player B and 

^ + c for player A. In case of rejection, it is guaranteed that player A will not play and hence 
player H’s outside option is action sb{s\,9b) with an expected payoff of Player A’s expected 
outside option is , corresponding to her expected maximum payoff derived in Example [2j As a 
result, player B by offering c, has an expected payoff of 


ere — 
re — 1 




c) + 



M2 

re + 1 ’ 


When n is large, the probability of acceptance is approximately, 

ere — c”' 

hm -= c. 

n—>oo re — 1 


Accordingly, in case of rejection, the expected payoffs become lim„_).oo = 0 for player B, and 
lim„_>.oo = 1 for player A. Player H’s expected payoff is thus c (/ii — c) and is maximized when 
she offers c* = ^ ii fii < 2 and c* = 1 otherwise. This leads to an expected social welfare for the 
single-offer mechanism of SW = + ^) -|- (1 — c*)(0-|- 1). Recall that the optimal social welfare 

is ^ if ^ and 1 otherwise. Therefore the mechanism has the following price of anarchy. 


PoA 


1 

f(/ri + i) + (l-f) 
_^ + Ml_ 

f(^i + i) + (l-f) 

^ +Mi ^ ^ 

. ^ + Mi 


if < i, 

if ^ < Ml < 2, 
if 2 < ;Ui, 


and the PoA has a maximum value of ^ (3 -|- 2^/To) ~ 1.203. This contrasts with the unbounded 
PoA obtained by the Nash equilibrium when no side payments are allowed. 


We now generalize the analysis done in Examples [T] and [ 2 j We proceed by studying the utility- 
maximizing strategy (s^i, 7 ) for player B and then derive the expected social welfare of the outcome 
for the single-offer mechanism. Note that, in case of agreement, the action of player B of type 6b 
is solely defined by sa as she has no incentives to defect from its best response sb{sa,6b)- By 
Proposition [21 player A accepts an offer whenever Aa{sa,6a) < '^Ab{sa-,6b)-, where Aa{sa-,6a) = 
— ua{sa-, Sa)- Player B obviously aims at choosing 7 and to maximize her payoff 
and we now study this optimization problem. In the case of an agreement, player B is left with a 
profit of 

ub{sb{sa, 6b), Ob) — 7 • ^b{sa, Ob)- 
Otherwise, player B gets an expected payoff of u^{sa,6b)- 

Definition 8. The probability that player A accepts the offer {sa,i), given that player B has type 
6b G &b, is 


P{sA, 1, 6b) = Pr [7 • Ab{sa, 6b) > ^a{sa, 6a)] 
with 

6{sa,x,6a) = < ^ 


’ Oa^Qa 


fA{6A) ■ <I(SA, 7 • ^b{sa, 6B),6A)d6A, 


ifx> Aa{sa,6a) 
otherwise. 
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The expected profit of players A and B for proposed action s = (sa, sb) and 7 when player B has 
type Ob is given by 

^eA[UB{sA,'y,OB)] = + -P(s^,7, 0b) ((1 - 7) • Ab(s^,0b)) , 

[Ua{sa, 7 ^(^b)] = ^eAb^Ai^A)] + P{sa,J, 0 b) ■ (7 ‘ ^b{sa, 0 b) - [Ayi(s^,0yi)]) . 

The optimal strategy of player B is specified in the following lemma. 

Lemma 3. On the single-offer mechanism, player B chooses s\{9b) and 'y*{s\,9B) such that 

s*a{0b) = arg-maxEe^ [Ub{sa,1*,0b)] , 

where 

'f*{sA,0B) = arg-maxP(s^,7,0B) • (1 - 7). 

7GR[0,i] 


5.1 Price of Anarchy 

We now analyze the quality of the outcomes in the single-offer mechanism. The first step is the 
derivation of a lower bound for the expected social welfare of the single-offer mechanism. Inspired 
by Lemma [21 instead of considering all pairs ( 5 ^, 7 ), the analysis restricts attention to a single 
action = arg-max^^g^^ ub(sb(s/i, 0b)) ^b)- We prove that, when offering to player A action 
and its associated optimal value for 7 , the expected social welfare is lower than the optimal 
pair (s^, 7 *). As a result, we obtain an upper bound to the price of anarchy of the single-offer 
mechanism. 

To make the discussion precise, consider the strategy where player B offers (s^, 7 *(s'^, 0 b)), 
with 7 *(s^, 0 b) being the optimal choice of 7 given s^, following the notation used in Lemma[3l 

Lemma 4. For any type 9 b £ ©b of player B, the expected social welfare achieved by the single¬ 
offer mechanism is at least the expected social welfare achieved by the strategy (s) 4 , 7 *(s'^, 0 b)). 

Proof. Let 7 * = ri^A^OB) and 7 ' = 7*(s'^,0b). The optimality condition of s* implies that 

[UB{s',ff,0B)] < Ee^ [UB{s*,ff,0B )]. (1) 

Two cases can occur. The first case is 

p{s'^,j',eB)<p{s%r,OB), 

i.e., the probability of player A accepting offer ( 5 ^, 7 *) is greater than if offered ( 5 ^, 7 '). Then, it 
must be that the expected payoff of player A is greater when offered ( 5 ^, 7 *), i.e., 

E,^ [Ua{sU,7',0b)] < Ee^ [UAis*A,7*,0B)] • 

This, together with Inequality ([T]) results in the single-offer mechanism having a greater expected 
social welfare. 

The second case is 

P{s'a,7',9b)> P{s*A,r,0B). 

Consider 7 " such that P{ s'^,i',0B) = P{s%ff,0B). The fact that the probabilities of accep¬ 
tance are the same implies that the expected payoff of Player A is the same in both cases, i.e., 
[[/a(s'^,7",0b)] = Ee^ [CAi(s^,7*,0b)]. This, together with Equation ([I]) yields 

EeASW{s*,ff,OB)]>EeASW{s',ff',OB)]. 
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This is equivalent to 

ub{s*,0b)+'K e^[uA{s%0A)] > (2) 

Similarly, consider 7** such that 


p{s'A,y,eB) = P{s%^**,eB), 


which implies 

[Ua{s'a, 7',0 b)] =E,^ [UA{s*A,-f**,eB)]- 

Existence of 7** is guaranteed by Inequality ([ 2 ]) which states that, there is more money in expec¬ 
tation to transfer to player A when choosing s* over s'. The fact that the acceptance probabilities 
are the same, together with Inequality ([ 2 ]), implies that 

EeASWis*,j**,eB)]>l^eASW{s',^',eB)]. 


Given that the expected payoff of player A is the same in both cases, it must be the case that the 
expected payoff of player B is higher when using (5^,7**). 

Therefore, we have found an offer for the single-offer mechanism with greater expected social 
welfare and a greater payoff for player B compared with strategy (5^,7'). □ 

We are ready to derive an upper bound for the induced price of anarchy for the single-offer mecha¬ 
nism. We first derive the price of anarchy of strategy (s^, 7') in case of agreement and disagreement 
of player A. 

Lemma 5 . Consider action s' = argmax^g^ ub(s, 0 ^) and let PoA^('j) and PoA^{j) denote the 
indueed price of anarchy if player A aceepts and rejects the offer given a proposed 7. Then, 


PoA^{'-^) = 1-1-7 and PoA^{'y) 



7 


Proof Let = s^{0a), = ua{sa,0a), u'a = ua{s',9a), u'b = ub{s',6b), = s^{s'^,0a) and 

'“b ~ '^Bi^A’^s)- Player B offers action and a monetary value of 7As(s^^) = 'yiu'^ — u^) to 
player A. Two cases can occur. 


Player A accepts: -|-7As(s^) > u^. Strategy is played. 


PoA^ < 


Ua + u'b ^ u'a + u'b + 1- u'b 

u'a + u'b ~ u'^ + u'j^ 


1 -I- 7- 


u 


B 


+ u'. 


B 


<1 + 7 - 


Player A rejects: -|-7Ab(s'^) < u^. Strategy {sa-,^^) is played. 


PoA^ < 


U4 -I- u'f 


B 


u^ + uO 


< 1 + 


u 


B 


u^+uO 


1 

< 1 H—) 
7 


where the last inequality comes from 

u'a + 'jAb{s'a) <ua "ju's <ua + 'yu^ - u'a <ua Pu^. 
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□ 


When 7 = 1, the price of anarchy is 2 but player B has no incentive to choose such a value. 
If 7 = 0.5, the price of anarchy is 3. Of course, player B will choose 7 ' = Lemma 

[5] indicates that the worst-case outcome is (1 -|- 7 ') when player A accepts with a probability 
,9b) and (1 -|- y) otherwise. This yields the following result. 

Theorem 6. The Bayesian price of anarehy of the single-offer mechanism for one-way games is 
at most 

(1 - 7 ', dB){l - 7 ')) > 

where 

7 ' = arg-maxP(s'^,7,6's)(l - 7 )- 

7Gl*[0.l] 

Proof. By combining Lemmas 0] and O we can derive the following upper bound for the PoA. 


PoA < 


Pis'^,^',9B)PoA^{y) + (1 - P{s'^,y,eB)) PoA^ii) 
7 ^ Gb ) (1 + 7O + (1 “ PWa ^ 7 ^ ^s)) + y ^ 


y + 1 

7 ' 

y + 1 

7 ' 


+ P{s'a^ 7 ^ ^b) ^ 7 ^ - y ^ 
+ pWaAM 


□ 


To get a better idea of how the mechanism improves the social welfare, it is useful to quantify the 
price of anarchy in Theorem [ 6 ] for a specific class of distributions. 

Corollary 7. If Aa{s'j^,9a) has a cumulative distribution funetion F {x) = {x/Ab)^ between!) and 
Ab, with 0 < /3 < 1, then 7 = and the price of anarchy is at most 

(2 + i)(l-/3/'(l + /3)-(^+i)). 

For example, if j5 = 1, then F{x) is the uniform distribution, 1 = \, and the expected priee of 
anarchy is at most 2.25. 

This corollary, in conjunction with Lemma O gives us the cost of enforcing individual rationality, 
moving from a price of anarchy of 2 to a price of 2.25 in the case of a uniform distribution. 

The strategy {s'^^,^') is of independent interest. It indicates how a player with limited compu¬ 
tational power can achieve an outcome that satisfies individual rationality without optimizing over 
all strategies. 


13 






6 Multi-Offer Mechanism 


This section extends the single-offer mechanism by allowing player B to make multiple monetary 
offers for the same proposed action. Our main result shows that making counteroffers under com¬ 
mitment does not improve efficiency over the single-offer mechanism. By commitment we mean 
that player B must be able to guarantee that the price schedule she originally announces will not 
be modified in the future. 

The single-offer mechanism was characterized by a an action sa_ G 5 ^ and a single value 7 G 
M[07]. The multi-offer mechanism is characterized by a 4 -tuple 

{sA eSA,n,j= (71,...,7„) = (pi,...,p„) GM|J)^;^]), 

where n is the number of offers, (71,..., 7^) is a sequence of numbers in ^[0,1] to compute the ratios 

of Ab{sa, 0 b) = ub{sb{sa,Gb),Sb) — u^{sa, 0 b) to be offered, and (pi,...,p„) is a sequence of 
probabilities for continuing to make offers where we assume that pi = 1 . The multi-offer mechanism 
is defined as follows: 

1. Player B selects an action G Sa to propose to player A. 

2. Player B also computes her outside option s^{sa, Ob) in case player A rejects action sa, and 
we denote by u^{sa, Ob) the expected payoff from her outside option. 

3 . Player B selects 7 = (71,..., 7^) G and p = (pi,... ,Pn) G KjJj , with pi = 1. Player B 

has to commit to this sequence of values (in spite of what she learns from player yl’s actions). 

4 . At step 1 < f < n, player B proposes a monetary value of 7* • Ab{sa, Ob) with Ab{sa, Ob) = 
ub{sb{sa, 0 b), 0 b) — u%{sa, 0 b) and 7* G K[o,i] to player A in the hope that she accepts to 
play strategy sa instead of strategy s^. 

5 . Player A decides whether to accept the offer. 

6. If player A accepts the offer, the outcome of the game is {sa, sb{sa, 0 b))- 

7. If player A rejects the offer, set f •(— i -|- 1 and go to step 4 with probability p*. 

8. Otherwise the outcome of the game is the outside option {s^{ 0 a), (sa, 0 b))- 

For ease of notation, we denote Ab(sa) = Ab{sa,0b) and Aa{sa) = Aa{sa,0a) for the rest 
of this section, where Aa{sa, Oa) = ua{s^{0a), Oa) — ua{sa, Oa)- In the multiple-offer mechanism, 
player B makes a sequence of offers ^iAB{sA) to player A to play strategy sa- The first offer 
is 7 iAb(sa)- If player A refuses the offer, then player B makes a second offer 72Ab(sa) with 
probability p 2 - Hence, with probability 1 — p 2 ; player B makes no offer and the outcome of the 
game is (^s^{0a), s^{sa,0b)) ■ In general, at iteration i, player B makes an offer ^iAB{sA) with 
probability pi and the outside option is played with probability 1 —pi- The mechanism stops when 
player A accepts an offer or when Player B stops making offers to player A. In this last case, once 
again, the outside option is played. 

Observe that player A could reject an offer even if it is more profitable than playing her max¬ 
imizing utility action s^{0a) because she may expect a better offer in the future. To avoid this 
behavior, the multi-offer mechanism imposes a condition on the 7 j’s and p^’s to ensure that player A 
accepts the first offer that gives her a higher payoff than her default action s^{0a)- Two conditions 
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must hold for player A to accept an offer in step i G [1,... , n]: 
faj Individual Rationality: 


li^B{sA) > Ayi(sA), (3) 

which is equivalent to Proposition [2j 

(b) Greater expected utility in step i than in step i + 1: 

7iAs(sA) + ua{sa, Oa) > Pi+i (7i+iAB(sA) + ua{sa, Oa)) + (1 - Pi+i)uA{sA {^a), Oa) 

which is equivalent to 


7 . ( 4 ) 

1 - Pi+l 


We now show that the multiple-offer mechanism is in fact equivalent to the single-offer mechanism. 
We use the notation 


'O i = 0, 


Si={ 


li Vi+iii+i n> i> 0, 
i-Pi+i ’ 

k7n i = n. 


so that Condition (jH) can be expressed as SiA^isA) > 

Note that if player A refuses an offer with 7 *, she will also refuse offers with smaller ratios. This 
observation leads to the following proposition. 


Proposition 4. In the multi-offer mechanism, 


7i+i > 7i, Vi G [1,... ,n - 1]. 

Therefore, Proposition 0] states that counteroffers should be increasing on time. 
Lemma 8. In the multiple-offer mechanism, for all i G [1,... ,n], 


7i > Si. 


Proof. Assume that 7 * < S^. By definition of Si, it follows that 7 * — Pi+iji < Ji — Pi+iji+i and 
hence 7 ^ > 7 i+i. This contradicts Proposition 01 stating that the 7 ’s are defined as a non-decreasing 
sequence. □ 

Corollary 9. Condition & implies Condition 

Proof Si < 7 i and Aa(sa) < SiAsisA) implies Aa(sa) < SiAsisA) < liAB{sA)- □ 

If player A rejected the offer in step f — 1, then Conditions ([3]) and ([4]) were both not satisfied in 
step i — 1. By Lemma [HI only two cases may occur: 

1. If 7 iAs(sA) < Aa(sa) then it must be the case that 

Si-iAsisA) < -ii-iAB{sA) < Aa{sa)- ( 5 ) 

2 . If jiAB{sA) > Aa{sa) then 

Si-iABisA) < Aa(sa) < 'ji-iABisA)- (6) 
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The disjunction of Conditions ([5]) and ([6]) yields the following inequality 




( 7 ) 


By Corollary m if player A accepts in step i given that she rejected in step i — 1, we have that 


Aa(sa) < SiAsisA)- (8) 

Recalling Definition[8l the cumulative distribution function of random variable A^(sa) was denoted 

by, 

P{sa,7) = Pr < 7 Ab(sa)] • 

Hence the probability of acceptance in step i can be derived from Conditions ([7|) and ([8]) as 
Fi[Si-iAB{sA) < Aa(sa) < S'iAB(sA)] =P{sA,Si) - P{sA,Si-i). 


Player B aims at choosing the y^’s, the probabilities pi's and action to maximize her expected 
utility, which is equivalent to the following optimization problem. 


n 

max > 
1=1 



{P{sA,Si)-P{sA,Si.i))){l-Ji) 


s.t. Pi = 1, 

So = 0, 


Q _ li Pi+lli+l 
1 - Pi+i 

Sn — 7n, 

Sl<S2<...<Sn<l. 


(9) 

( 10 ) 

( 11 ) 

( 12 ) 

(13) 

(14) 


Where the term lXj=iPj is the probability of reaching to the z-th offer. We are now ready to state 
the main result of this section. 


Theorem 10. The multi-offer mechanism is equivalent to the single-offer mechanism in one-way 
games. 

Proof. By Equation (fT^ . 


(1 -pi+i){l - Si) = (1 - 7i) -Pi+iil - 7i+i)- 
Then, by using (fT^ and grouping the P{.SA,Si) terms, the objective function becomes 


n—1 

i=l 

Observe that each term in the objective function features an expression of the form P{sa,x ){1 — x). 
Hence the objective is bounded by above by 



1 (1-Pi+i)P’('SA,5'i)(l - ^i) 




+ 


YlPj R(sA,7n)(l -7n)- 


(15) 




n—1 

m<Yl 

i=l 


() (1 - Pi+i) -c +c-fipj 

\i=i / J i=i 
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where C = maxa; P(s^, x)(l — x). We show that, for any given probabilities p, there is a unique 
solution that meets this upper bound. Let x* = arg max^, P(sa, x). The right-hand term in (I15p 
is optimized by setting 7 ^ = x*. We show by induction that all the other terms are optimized by 
setting 7 j = x*. Assume that this holds for 7 i+i, ..., 7n. We need to optimize P{sa, Si){l — Si). 
By induction, 

o 7i - Pi+ix* 

‘Ji — 

1 - Pi+1 

and assigning x* to 7 ^ gives Si = x* and P{sa, Si){l — Si) = C. Since all 7 * are equal, this concludes 
the proof. □ 


The above derivation is related to a well-known result from Sobel and Takahashi ( 1983l i. which 
models an iterative bargaining where there is a buyer with a private reservation price and a seller 
with reservation price 0 who makes all the offers. There is a known fixed discount factor for each 
player and, when these discount factors are equal (this is equivalent to have a probability for a 
next offer), they showed that, under commitment, the infinite horizon bargaining is equivalent to 
the single shot. There are differences between their model and ours: In our model, the buyer is 
making the offers, the probabilities are not fixed a priori (Player B can choose them), and both 
outside options are private. 


7 Discussion 


In one-way games, the utility of one player does not depend on the decisions of the other player. We 
showed that, in this setting, the outcome of a Nash equilibrium can be arbitrarily far from the social 
welfare solution. We also proved that it is impossible to design a Bayes-Nash incentive-compatible 
mechanism for one-way games that is budget-balanced, individually rational, and efficient. To 
alleviate these negative results, we proposed two privacy-preserving mechanisms: a single-offer and 
a multi-offer mechanism and showed that both are equivalent. 

The single-offer mechanism is simple for both parties, as well as for the broker who just makes 
sure that the players follow the protocol. This mechanism also requires minimal information from 
the agents who perform all the combinatorial computations, while it incentivizes them to cooperate 
towards the social welfare in a distributed setting. Moreover, the mechanism has the following 
desirable properties: it is budget-balanced and satisfies the individual rationality constraints and 
Bayesian incentive-compatibility conditions. Additionally, we showed that, in a realistic setting, 
where agents have limited computational resources, a simpler version of the mechanism can be 
implemented without overly deteriorating the social welfare. 

It is an open question whether there exists another mechanism (possibly more complex) that 
could lead to a better efficiency, while keeping the above properties. Indeed, in one-way games, 
player A has a intrinsic advantage over player B, which is not easy to overcome. One possible 
promising mechanism consists of player B setting rewards for all player A’s actions, and player 
A choosing one in return for that mon ey. _ This is known as the Bayesian Unit-demand Item- 


Prici ng Problem ( BUPP ) Chawla et ali . 120071 1 . Recent work has shown this problem to be NP- 
hard Chen et ali I2ni4l]. but a f actor 3 approximation to the optimal e xpected reven u e of p layer 


B is obtained in [Chawla et all l2007t | (subsequently improved to 2 in [Chawla et all [2010!]). In 
the context of our paper, several interesting questions arise from the Bayesian Unit-demand Item- 
Pricing Problem. What is the efficiency achieved by the BUPP in one-way games? What is the 
impact of a constant factor approximation for the revenue on the social welfare in one-way games? 

There are also many other directions for future research. It is important to generalize one-way 
games to multiple players. Moreover, there are applications where the dependencies are in both 
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directions, e.g., the restoration of the power and the gas systems considered in Coffrin et al. ( 2012lh 
These applications typically have mnltiple components to restore and the dependencies form an 
acyclic graph. Hence such a mechanism would likely need to consider this internal structure to 
obtain efficient outcomes. 
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